智能论文笔记

UNO-QA: An Unsupervised Anomaly-Aware Framework with Test-Time Clustering for OCTA Image Quality Assessment

Juntao Chen , Li Lin , Pujin Cheng , Yijin Huang , Xiaoying Tang

分类：计算机视觉

2022-12-20

Medical image quality assessment (MIQA) is a vital prerequisite in various medical image analysis applications. Most existing MIQA algorithms are fully supervised that request a large amount of annotated data. However, annotating medical images is time-consuming and labor-intensive. In this paper, we propose an unsupervised anomaly-aware framework with test-time clustering for optical coherence tomography angiography (OCTA) image quality assessment in a setting wherein only a set of high-quality samples are accessible in the training phase. Specifically, a feature-embedding-based low-quality representation module is proposed to quantify the quality of OCTA images and then to discriminate between outstanding quality and non-outstanding quality. Within the non-outstanding quality class, to further distinguish gradable images from ungradable ones, we perform dimension reduction and clustering of multi-scale image features extracted by the trained OCTA quality representation network. Extensive experiments are conducted on one publicly accessible dataset sOCTA-3*3-10k, with superiority of our proposed framework being successfully established.

translated by 谷歌翻译

YoloCurvSeg: You Only Label One Noisy Skeleton for Vessel-style Curvilinear Structure Segmentation

Li Lin , Linkai Peng , Huaqing He , Pujin Cheng , Jiewei Wu , Kenneth K. Y. Wong , Xiaoying Tang

分类：计算机视觉

2022-12-11

Weakly-supervised learning (WSL) has been proposed to alleviate the conflict between data annotation cost and model performance through employing sparsely-grained (i.e., point-, box-, scribble-wise) supervision and has shown promising performance, particularly in the image segmentation field. However, it is still a very challenging problem due to the limited supervision, especially when only a small number of labeled samples are available. Additionally, almost all existing WSL segmentation methods are designed for star-convex structures which are very different from curvilinear structures such as vessels and nerves. In this paper, we propose a novel sparsely annotated segmentation framework for curvilinear structures, named YoloCurvSeg, based on image synthesis. A background generator delivers image backgrounds that closely match real distributions through inpainting dilated skeletons. The extracted backgrounds are then combined with randomly emulated curves generated by a Space Colonization Algorithm-based foreground generator and through a multilayer patch-wise contrastive learning synthesizer. In this way, a synthetic dataset with both images and curve segmentation labels is obtained, at the cost of only one or a few noisy skeleton annotations. Finally, a segmenter is trained with the generated dataset and possibly an unlabeled dataset. The proposed YoloCurvSeg is evaluated on four publicly available datasets (OCTA500, CORN, DRIVE and CHASEDB1) and the results show that YoloCurvSeg outperforms state-of-the-art WSL segmentation methods by large margins. With only one noisy skeleton annotation (respectively 0.14%, 0.02%, 1.4%, and 0.65% of the full annotation), YoloCurvSeg achieves more than 97% of the fully-supervised performance on each dataset. Code and datasets will be released at https://github.com/llmir/YoloCurvSeg.

translated by 谷歌翻译

Diversity Boosted Learning for Domain Generalization with Large Number of Domains

Xi Leng , Xiaoying Tang , Yatao Bian

分类：机器学习

2022-07-28

机器学习算法使平均训练损失最小化通常由于训练数据之间相关性的贪婪开发而遭受泛化性能差，而训练数据在分配变化下并不稳定。它启发了各种域泛化作品（DG），其中一系列方法（例如因果匹配和鱼类）通过成对域操作来工作。他们需要$ o（n^2）$成对域操作，其中$ n $域通常都很昂贵。此外，尽管DG文献中的一个共同目标是学习针对域引起的虚假相关性的不变表示，但我们强调了减轻对象引起的伪造相关性的重要性。基于观察到多样性有助于减轻虚假相关性的观察，我们提出了利用确定点过程（DPP）的多样性增强了两级抽样框架（DOMI），以有效地在大量域中进行最有用的信息。我们表明，DOMI帮助训练强大的模型，以抵抗来自域侧和对象端的虚假相关性，从而大大提高了旋转的MNIST，旋转的时尚MNIST和IWILDCAM数据集对主链DG算法的性能。

translated by 谷歌翻译

AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation

Junyan Lyu , Yiqi Zhang , Yijin Huang , Li Lin , Pujin Cheng , Xiaoying Tang

分类：计算机视觉

2022-07-27

卷积神经网络已广泛应用于医学图像分割，并取得了相当大的性能。但是，性能可能会受到训练数据（源域）和测试数据（目标域）之间域间隙的显着影响。为了解决此问题，我们提出了一种基于数据操作的域泛化方法，称为域概括（AADG）的自动增强。我们的AADG框架可以有效地采样数据增强策略，从而产生新的领域并从适当的搜索空间中多样化训练集。具体而言，我们介绍了一项新的代理任务，以最大程度地提高了多个增强新颖的域之间的多样性，该域通过单位球体空间中的凹痕距离来衡量，从而使自动化的增强可牵引。对抗性训练和深入的强化学习有效地搜索了目标。全面执行了11个公开底部的底面图像数据集的定量和定性实验（四个用于视网膜血管分割，四个用于视盘和杯子和杯（OD/OC）分割（OD/OC）分割，视网膜病变细分进行了三个）。两个用于视网膜脉管系统分割的八八个数据集进一步涉及验证跨模式泛化。我们提出的AADG通过视网膜船，OD/OC和病变细分任务的相当大的利润来表现出最新的概括性能，并优于现有方法。学到的政策在经验上得到了证实为模型不平衡，并且可以很好地转移到其他模型中。源代码可在https://github.com/crazorback/aadg上找到。

translated by 谷歌翻译

GAMMA Challenge:Glaucoma grAding from Multi-Modality imAges

Junde Wu , Huihui Fang , Fei Li , Huazhu Fu , Fengbin Lin , Jiongcheng Li , Lexing Huang , Qinji Yu , Sifan Song , Xinxing Xu

分类：计算机视觉

2022-02-14

Color fundus photography and Optical Coherence Tomography (OCT) are the two most cost-effective tools for glaucoma screening. Both two modalities of images have prominent biomarkers to indicate glaucoma suspected. Clinically, it is often recommended to take both of the screenings for a more accurate and reliable diagnosis. However, although numerous algorithms are proposed based on fundus images or OCT volumes in computer-aided diagnosis, there are still few methods leveraging both of the modalities for the glaucoma assessment. Inspired by the success of Retinal Fundus Glaucoma Challenge (REFUGE) we held previously, we set up the Glaucoma grAding from Multi-Modality imAges (GAMMA) Challenge to encourage the development of fundus \& OCT-based glaucoma grading. The primary task of the challenge is to grade glaucoma from both the 2D fundus images and 3D OCT scanning volumes. As part of GAMMA, we have publicly released a glaucoma annotated dataset with both 2D fundus color photography and 3D OCT volumes, which is the first multi-modality dataset for glaucoma grading. In addition, an evaluation framework is also established to evaluate the performance of the submitted methods. During the challenge, 1272 results were submitted, and finally, top-10 teams were selected to the final stage. We analysis their results and summarize their methods in the paper. Since all these teams submitted their source code in the challenge, a detailed ablation study is also conducted to verify the effectiveness of the particular modules proposed. We find many of the proposed techniques are practical for the clinical diagnosis of glaucoma. As the first in-depth study of fundus \& OCT multi-modality glaucoma grading, we believe the GAMMA Challenge will be an essential starting point for future research.

translated by 谷歌翻译

Unsupervised Domain Adaptation for Cross-Modality Retinal Vessel Segmentation via Disentangling Representation Style Transfer and Collaborative Consistency Learning

Linkai Peng , Li Lin , Pujin Cheng , Ziqi Huang , Xiaoying Tang

分类：计算机视觉

2022-01-13

已经开发了各种深度学习模型，以从医学图像分段解剖结构，但它们通常在具有不同数据分布的另一个目标域上测试时具有差的性能。最近，已经提出了未经监督的域适应方法来缓解这种所谓的域移位问题，但大多数都是针对具有相对较小域移位的方案设计的，并且在遇到大域间隙时可能会失败。在本文中，我们提出DCDA，一种新的跨模型无监督域适应框架，用于具有大域移位的任务，例如，来自Octa和OCT图像的分段视网膜血管。 DCDA主要包括解开表示样式转移（DRST）模块和协作一致性学习（CCL）模块。 DRST将图像分解成内容组件和样式代码，并执行样式传输和图像重建。 CCL包含两个分段模型，一个用于源域，另一个用于目标域。这两种模型使用标记的数据（与相应的传输图像一起）进行监督学习，并在未标记的数据上执行协作一致性学习。每个模型都侧重于相应的单个域，并旨在产生专用域特定的分段模型。通过对视网膜船分割的广泛实验，我们的框架从Octa到Oct和Oct到Octa的OctA到Octa的骰子分数均达到目标培训的甲骨文，显着优于其他最先进的方法。

translated by 谷歌翻译

COROLLA: An Efficient Multi-Modality Fusion Framework with Supervised Contrastive Learning for Glaucoma Grading

Zhiyuan Cai , Li Lin , Huaqing He , Xiaoying Tang

分类：计算机视觉

2022-01-11

青光眼是可能导致盲目的眼科疾病之一，早期检测和治疗非常重要。眼底图像和光学相干性断层扫描（OCT）图像均为广泛使用的诊断青光眼的方式。然而，现有的青光眼分级方法主要利用单一的方式，忽略眼底和OCT之间的互补信息。在本文中，我们提出了一个有效的多种式监督对比的对比学习框架，名为Corolla，用于青光眼分级。通过层分割以及厚度计算和投影，从原始OCT卷中提取视网膜厚度图，并用作更换的模态，导致更有效的计算，内存使用较少。鉴于医学图像样本的高结构和分布相似之处，我们采用了监督的对比学习，以提高模型的歧视力，更好地融合。此外，对成对的眼底图像和厚度图的特征级融合以提高诊断精度。在Gamma DataSet上，与最先进的方法相比，我们的Corolla框架达到了压倒性的青光眼分级性能。

translated by 谷歌翻译

Towards Federated Learning on Time-Evolving Heterogeneous Data

Yongxin Guo , Tao Lin , Xiaoying Tang

分类：机器学习

2021-12-25

联合学习（FL）是一种新兴学习范例，可以通过确保边缘设备上的客户端数据局部性来保护隐私。由于学习系统的多样性和异质性，FL的优化在实践中具有挑战性。尽管最近的研究努力改善异构数据的优化，但时间不断变化的异构数据在现实世界方案中的影响，例如改变客户数据或在训练期间留下或离开的间歇性客户，并未得到很好地研究。在这项工作中，我们提出了持续的联邦学习（CFL），灵活的框架，以捕获FL的时间不正常性。 CFL涵盖复杂和现实的情景 - 在之前的流派中评估了挑战 - 通过提取过去的本地数据集的信息并近似当地目标函数。从理论上讲，我们证明CFL方法在时间不断发展的场景中实现了比\ FEDAVG更快的会聚率，其中益处依赖于近似质量。在一系列实验中，我们表明数值调查结果与收敛分析相匹配，CFL方法显着优于其他SOTA FL基线。

translated by 谷歌翻译

More is Better: A Database for Spontaneous Micro-Expression with High Frame Rates

Sirui Zhao , Huaying Tang , Xinglong Mao , Shifeng Liu , Hanqing Tao , Hao Wang , Tong Xu , Enhong Chen

分类：计算机视觉

2023-01-03

As one of the most important psychic stress reactions, micro-expressions (MEs), are spontaneous and transient facial expressions that can reveal the genuine emotions of human beings. Thus, recognizing MEs (MER) automatically is becoming increasingly crucial in the field of affective computing, and provides essential technical support in lie detection, psychological analysis and other areas. However, the lack of abundant ME data seriously restricts the development of cutting-edge data-driven MER models. Despite the recent efforts of several spontaneous ME datasets to alleviate this problem, it is still a tiny amount of work. To solve the problem of ME data hunger, we construct a dynamic spontaneous ME dataset with the largest current ME data scale, called DFME (Dynamic Facial Micro-expressions), which includes 7,526 well-labeled ME videos induced by 671 participants and annotated by more than 20 annotators throughout three years. Afterwards, we adopt four classical spatiotemporal feature learning models on DFME to perform MER experiments to objectively verify the validity of DFME dataset. In addition, we explore different solutions to the class imbalance and key-frame sequence sampling problems in dynamic MER respectively on DFME, so as to provide a valuable reference for future research. The comprehensive experimental results show that our DFME dataset can facilitate the research of automatic MER, and provide a new benchmark for MER. DFME will be published via https://mea-lab-421.github.io.

translated by 谷歌翻译

MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding

Steven H. Wang , Antoine Scardigli , Leonard Tang , Wei Chen , Dimitry Levkin , Anya Chen , Spencer Ball , Thomas Woodside , Oliver Zhang , Dan Hendrycks

分类：自然语言处理

2023-01-02

Reading comprehension of legal text can be a particularly challenging task due to the length and complexity of legal clauses and a shortage of expert-annotated datasets. To address this challenge, we introduce the Merger Agreement Understanding Dataset (MAUD), an expert-annotated reading comprehension dataset based on the American Bar Association's 2021 Public Target Deal Points Study, with over 39,000 examples and over 47,000 total annotations. Our fine-tuned Transformer baselines show promising results, with models performing well above random on most questions. However, on a large subset of questions, there is still room for significant improvement. As the only expert-annotated merger agreement dataset, MAUD is valuable as a benchmark for both the legal profession and the NLP community.

translated by 谷歌翻译